Effect of foreign accent on speech recognition in the NATO n-4 corpus
نویسندگان
چکیده
We present results from a series of 151 speech recognition experiments based on the N4 corpus of accented English speech, using a small vocabulary recognition system. These experiments looked at the impact of foreign accent on speech recognition, both within non-native accented English and across different accents, with particular interest in using context free grammar technology to improve callsign identification. Results show that phonetic models built from foreign accented English are not less accurate than native ones at decoding novel data with the same accent. Cross accent recognition experiments show that phonetic models from a given accent group were 1.8 times less accurate in recognizing speech from a different accent. In contrast to other attempts to perform accurate recognition across accents, our approach of training very compact, accent-specific models (less than 3 hours of speech) provided very accurate results without the arduous task of adapting a phonetic dictionary to every accent.
منابع مشابه
Foreign accent identification based on prosodic parameters
In this paper we propose an automatic approach for foreign accent identification. Knowledge of the speaker’s origin allows to adapt acoustic models for non-native speech recognition. In this study, we use a statistical approach based on prosodic parameters. This approach relies on the fact that prosody is different between languages, and has been done within the framework of the HIWIRE (Human I...
متن کاملFactors affecting i-vector based foreign accent recognition: A case study in spoken Finnish
I-vector based recognition is a well-established technique in state-of-the-art speaker and language recognition but its use in dialect and accent classification has received less attention. In this work, we extensively experiment with the spectral feature based i-vector system on Finnish foreign accent recognition task. Parameters of the system are initially tuned with the CallFriend corpus. Th...
متن کاملAutomatic Detection of Foreign Accent for Automatic Speech Recognition
Recognition of foreign accented speech remains among the most difficult tasks in automatic speech recognition. It was observed that using models trained on foreign data together with native models improves the recognition for speakers with foreign accent. However such an approach degrades the recognition performances on native speakers. In order to avoid such performance degradation the degree ...
متن کاملDialect analysis and modeling for automatic classification
In this paper, we present our recent work in the analysis and modeling of speech under dialect. Dialect and accent significantly influence automatic speech recognition performance, and therefore it is critical to detect and classify non-native speech. In this study, we consider three areas that include: (i) prosodic structure (normalized f0, syllable rate, and sentence duration), (ii) phoneme a...
متن کاملMulti-accent and accent-independent non-native speech recognition
In this article we present a study of a multi-accent and accentindependent non-native speech recognition. We propose several approaches based on phonetic confusion and acoustic adaptation. The goal of this article is to investigate the feasibility of multi-accent non-native speech recognition without detecting the origin of the speaker. Tests on the HIWIRE corpus show that multi-accent pronunci...
متن کامل